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(57) Abstract 

A polypeptide having immunological aaivity for use as a diagnostic reagent and/or a vacdne component for the HIV vi- 
rus. The polypeptide comprises a substantial portion of each of more than one of the constituent proteins coded for by the HIV- 
pel gene, namely HIV-poi protease, HIV-pol reverse transcriptase, HIV-pol RNase H and HIV-pol integrase. 
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POLYPEPTTHF HAVING IMMUNQT/^fiTCAL ACTTVITY FOP TTg-p &g 
DIAGNQSTTr REAGENT AND/OR VACCINE 

TECHNICAL FTPT.n 

This invention relates to a polypeptide having 
5 immunological activity for use as a diagnostic reagent 
and/or a vaccine component. 
BACKGROUNn RPT 

Diagnostic kits for use in screening individuals for 
infection with hiiman immunodeficiency virus (HIV) infection 
10 frequently include reagents comprising HIV antigens which 
are used to detect antibodies using known immunological 
techniques including ELISA, Western Blot, latex 
agglutination and immuno-luminescent and immuno-f luorescent 
techniques. 

15 The effectiveness of such techniques however depends 
upon selection of suitable immunological reagents and one 
particular difficulty which arises is that particular 
reagents are often specific to individual strains or groups 
of strains of HIV. Thus, for example, known diagnostic 

20 reagents based upon HIV-1 may fail to detect antibodies 
resulting from an infection of a patient with HIV-2. 

Similarly, in the production of vaccines designed to 
protect individuals against HIV infection, the use of 
antigens derived from one particular strain of HIV may fail 

25 to provide adequate protection against infection with other 
strains . 

It is an object of the present invention to overcome 

such problems. 

DISCLOSURE OF INVENTION 
30 It has now been found that the product of expressing a 

substantial part of the HIV-pol gene in a suitable host has 

antigenic properties which allows the above-mentioned 

problems to be overcome. 

Thus according to one aspect of the present invention 
35 there is provided the use as an antigenic reagent in the 

diagnostic test or as a vaccine component of a polypeptide 
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comprising a substantial portion of each of more than one of 
the constituent proteins coded for by the HIV-pol gene. 

Diagnostic kits and vaccines comprising said polypeptide 
form further aspects of the present invention. 
5 The HIV-pol gene codes for four enzymes, namely a 

protease, a reverse transcriptase, a ribonuclease referred 
to as RNAse H and an enzyme referred to as Integrase. 

It is believed that during infection of a T cell by HIV 
a full length precursor is expressed which is then cut up 
10 into the discrete proteins listed above. These have the 
following activities and (it is thought) act in the order 
indicated 

Protease Precursor Cleavage 

Reverse Transcriptase Preparation of viral DNA from viral 
15 RNA 

RNAse H Destruction of viral RNA leaving 

newly synthesised DNA 

Integrase Insertion of said DNA into host cell 

genome 

According to a preferred aspect of the present 
invention, said constituent proteins are enzymes coded for 
by the HIV-pol gene and the polypeptide thus comprises a 
substantial portion of each of a plurality of enzymes 
selected from HIV-pol protease, HIV-pol reverse transcrip- 
tase, HIV-pol RNAse H and HIV-pol Integrase. Most prefer- 
sQDly, the polypeptide comprises substantial portions of all 
four of said enzymes. 

In vivo , the initial product of expressing the HIV-pol 
gene is cleaved into its individual elements by the 
protease • The active site for proteolytic activity occurs 
adjacent the NHg-terminus of the expression product, 
corresponding to the 5 '-end of the protease gene. 



V 
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According to a preferred aspect of the present 
invention, the polypeptide omits at least that part of the 
amino acid sequence of the HIV-pol protease gene which codes 
for the active site responsible for proteolytic activity. By 
5 omitting this portion, the integrity of the polypeptide is 
maintained and it is less liable to degrade. 
BRIEF D ESCRIPTTQN OF DRAWINGS 

Figure 1 is a schematic diagram showing the procedure of 
Example l; 

10 Figure 2 shows the results of electrophoresis tests 

carried out in the manner explained in Example 2; and 

Figure 3 is a graph showing the results of the 
experiments carried out in Example 3. 

BEST MODK FO R CARRVTNG OUT THE INVRyprTnfJ 

15 The HIV-pol gene of several strains of HIV-l has been 

cloned and the corresponding amino acid sequences derived 
from the determined DNA sequences. The amino acid sequences 
Of ten strains appear in the accompanying Table i at the end 
of this disclosure. In Table l, the full sequence of strain 
20 HIV HXB2 is given, whereas for the other nine strains, only 
sequence differences are listed. As used herein, the term 
"constituent protein coded for by the HIV-pol gene" refers 
to a protein having sufficient amino acid homology with the 
sequence of HIV HXB2 appearing in the accompanying Table so 
25 as to result in antibodies raised against the protein cross- 
reacting with a polypeptide consisting of the precise amino 
acid sequence of HIV HXB2. 

The HIV-pol gene can be expressed to produce the desired 
polypeptide by various techniques, e.g. some or all of the 
30 baculovirus techniques described in U.S. Patent 4,745,051 to 
Gale E. Smith et al issued on May 17, 1988; Baculovinls 
Vectors for Expression of Foreign Genes by C. Yong Kang, 
Advances in Virus Research, Vol. 35, pp 177-192, Academic 
Press Inc., 1988; A Manual of Methods for Baculovirus 
35 Vectors and Insect Cell Culture Procedures, Max D. Summers 
and Gale E. Smith, May 1987, Texas ASM University; and 
Baculoviruses as Gene Expression Vectors, Lois K. Miller, 
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Ann. Rev. Microbiol. 42, pp 177-1991; the disclosures of 
which are incorporated herein by reference. However our 
Canadian Patent Application Serial No. 591,908 filed on 23rd 
February 1989 (and equivalent British Patent Application 
5 Serial No» 89 04426.7 filed on February 27, 1989 and US 
Patent Application Serial No. 316,768 filed on February 28, 
1989) describes and claims an improved baculovirus 
expression system capable of producing foreign gene proteins 
at high levels and the use of this expression system is 

10 particularly preferred for expressing the polypeptide of the 
present invention. 

The process disclosed in our Canadian patent employs a 
recombinant baculovirus containing at least a major part of 
a polyhedrin gene promoter region, a transcription 

15 termination sequence^ of a polyhedrin structural gene, a 
foreign structural gene (e.g. an HIV-pol gene) having a 
translation start codon followed by coding sequences and a 
translation stop codon. The foreign gene is located between 
the promoter region and the termination sequence. 

20 Immediately upstream of the start codon there is a putative 
insect cell ribosome binding site for the polyhedrin gene 
effective for overcoming resistance of susceptible insect 
cells to express the foreign gene at a high level. The 
putative ribosome binding site comprises at least the final 

25 four nucleotides of the sequence 5 « -ACCTATAAAT-3 » . 

Example 3 of the Canadian application describes the 
production of the pol protein of HIV-1 in a baculovirus 
expression system based on Autoarapha californica 
nucleopolyhedrosis virus (ACNPV) and specifies that a 

30 recombinant baculovirus designated ACNPV-HIV-YK-pol has been 
deposited at the American Type Culture Collection of 123 01 
Parklawn Drive, Rockville MD 20852, USA under Accession No. 
ATCC VR 2233. Deposit was made on November 30, 1988. The 
disclosure of our Canadian Patent Application Serial No. 

35 591,908 is incorporated herein by reference. 

Utilising the procedures described in Example 3 of 
Canadian Patent Application Serial No. 591, 908, a polypeptide 
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comprising the protease, RNAse H and Integrase enzymes of 
HIV strain HIV-XB2 may be produced. 

The polypeptide can be used as a diagnostic reagent or 
vaccine component in ways known to persons skilled in the 
5 art, e.g. by the techniques indicated in the publication 
entitled Clinica, Testing for HIV and AIDS, The Next 
Five Years, George Street Publications Ltd., Richmond, 
Surrey, UK, the disclosure of which is incorporated herein 
by reference. 

10 The invention is illustrated in more detail by the 

following Examples. Example 1 illustrates the production of 
a modified recombinant plasmid pUC18-Dpol3 having a 273 bp 
deletion at the 5 '-terminus and its expression as 
polypeptide lacking the first 91 amino acids at the 

15 NH2-terminus of the HIV-pol protease. Excimples 2 and 3 relate 
to the expression of the polypeptide and its use as a 
diagnostic reagent. 

EXAMPLE 1 

Construction of baculovirus transfer vector containing HIV-1 

20 pol gene with 273 bp deletion at 5' terminus 

As illustrated in Figure 1, the Bglll and Sail fragment 
of plasmid pHXB-2D containing the HIV-1 pol coding region 
was isolated and inserted into BamHI and Sail sites of 
pUClS. The resulting recombinant plasmid (pUC18-Dpol 1) was 

25 cut with Sstl and dephosphorylated. A synthetic doxible- 
stranded crossover linker containing a Sstl cohesive end, a 
BamHI site, the putative insect Spodoptera frugiperda (SF9) 
cell ribosome binding site (P) and 15 nucleotides of the 
homology searching sequences which overlaps with the 5 ' 

3 0 teirminus of the pol gene was ligated at the Sstl site and 
transformed. The recombinant plasmid, (pUC18-Dpol 2) was 
isolated, digested with sPHl, dephosphorylated and ligated 
with another crossover linker DNA containing SphI cohesive 
end at the 3« terminus, BamHI site and 15 nucleotides of the 

35 homology searching sequences which recognise the 3' terminus 
of the pol gene. The resulting recombinant plasmid 
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(pUC18-Dpol 3) contains the putative SF9 cell ribosome 
binding site (P) followed with pol open reading frame 
starting with the first ATG (TI) codon (map unit 2357-2359) 
in the pol gene and the translation termination (TT) codon 
5 TAG (map unit 5093-5095) . This whole cassette was flanked 
with BamHl sites. The BamHl fragment was isolated and 
inserted into the BamHl site of the pAcYMl baculovirus 
transfer vector (pAcYMl-Dpol) . The pAcYMl-Dpol transfer 
vector DNA was used to co-trans feet SF9 cells with wild type 
10 AcNPV DNA to isolated recombinant AcNPV HIV-YK pol virus. 

EYAMPT.F O. 

Expression of t3o1 aene produc ts bv recombinant bacul nviruses 
Recombinant AcNPV-HIVWHpol contains an insert comprising 
essentially the whole DNA sequence of the HIV-pol gene (see 

15 Table 2 at the end of the present disclosure) . When 

expressed, the resulting full length gene product of the 
HIV-pol gene is "processed", i.e. the proteolytic active 
site of the HIV pol protease gene cleaves the protein into 
66 kD, 51 kD and 32 kD fragments. 

20 By way of comparison, recombinant AcNPV-HIVYKpol (see 
Table 3 at the end of the present disclosure) omits 
NHj-terminal amino acid sequences containing the proteolytic 
active site of the HIV-pol protease. When expressed, the 
resulting gene product is not "processed", i.e. the - 95 kD 

25 protein remains intact. 

The following experiments illustrate this. 
Uninfected S. fruaioerda (SF9) cells, or SF9 cell 
infected with recombinant baculoviruses AcNPV-HIVWHpol , 
AcNPV-HIVYKpol or with wild-type AcNPV, were harvested after 

30 72 hours of infection. Lysates of the infected or uninfected 
cells were electrophoresed in a 12% polyacrylamide Laemmli 
gel and proteins are identified by either Coomassie blue 
staining (S) or Western blot analyses (W) using the standard 
HIH HIV positive immunoglobulin. As shown in Figure 2, lanes 

35 1, 2 and 3 represents the lysates of AcNPV-HIVYHpol 

recombinant virus infected cells, lanes 4, 5 and 6 represent 
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the lysates of AcNPV-HIVYKpol recombinant virus infected 
cells, lane 7 shows the wild-type AcNPV infected cell 
lysate, lane 8 shows uninfected cell lysate and lane 9 shows 
molecular weight markers. Lane 3 and 6 show the whole cell 
5 lysate, lanes 2 and 5 show proteins in the infected cell 
nuclei and lanes l and 4 show proteins in the infected cell 
cytoplasm. P denotes polyhedrin protein and arrows show 95K 
Dal uncleaved pol gene product representing 91 amino acid 
deletion of protease produced by AcNPV-HIVYKpol virus and 
10 66K Dal, 51 K Dal and 33K Dal processed pol gene products in 
AcNPV-HIVYHpol virus infected cells. 

EXAMPLE 3 

A. Produc tion of pol gene produn-h 

Recombinant ACNPV-HIVYKpol virus infected Spodopter^ 
15 .fruqiperda (SF9) cells were harvested 4 days after 

infection. Nuclei of infected cells containing most of the 
pol gene product were isolated by treating the infected 
cells with 0.1% Triton X-100 and 0.5% NP4 0 on ice for 2 0 
minutes followed by centrifugation at 750 g for lo minutes. 
20 The pelleted nuclei were denatured with 1% SDS in TRis-HCl 
pH 8.0 at room temperature for 30 minutes. The cellular DNAs 
were removed by ethanol precipitation using 2 volumes of 
100% ethanol. The SDS in the solution were removed by add- 
ition of 25 DM KCL incubated at 4'C for 30 minutes' followed 
25 by centrifugation at 12,700 g for 15 minutes. The -pol gene 
product in the supernatant was used for anti-pol ELISA. 

B. Detectio n of HTV antibodies by ft.t.«:;a 

The pol antigen was diluted in PBS and dispensed in a 
microtiter plate (Nunc cat 269620) . The concentration of pol 
30 to coat plates was determined empirically on the strength of 
bands on polyacrylamide gels. 

The concentration of pol necessary to coat one well was 
between 1 and 10 fig. 

The plate was covered and incubated at 4'C. The time of 
35 incubation varied between 12 and 24 hrs without no apparent 
differences in reactivity. 
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The plates were then washed three times in PBS tween 20 
employing a Skatron plate washer. 

Various standards, NIH HIV+ immunoglobulin (NIH STD) , 
pool HIV+ plasma (PAT STD) and plasma from non-infected 
5 individuals (NS) were employed. The standards were diluted 
beginning at 1:200 for NIH STD, and l:io for PAT STD and NS. 
Unknowns were tested usually at 1:50 but dilutions as high 
as 1:10 can be employed. 

All samples were inactivated before testing. Normal sera 

10 were processed in the same fashion as sera from AIDS 
patients. The inactivation was performed with 
4 • -aminoethyltrioxsalen- hydrochloride (AMT) from Lee 
Biomolecular Research Inc. (San Diego, California cat 231) 
and an ultra violet light trans-illuminator (Spectroline 

15 model TC-365, Fisher Scientific Ottawa Ont.). The AMT was 
reconstituted in 50% ethanol at 1 ng/ml. The sera was 
aliquoted in Eppendorf tubes and for every 100 ^1 of serum 
or plasma, 10 fxl of AMT was added to the sample. The 
samples were layed in the transilluminator and irradiated 

20 for 5 minutes. An additional 10 fjil of AMT was added to the 
sample and the samples were irradiated for a further 5 
minutes. The samples were inactivated by this procedure. 

The incubation time of the human-anti-pol was 30 to 40 
minutes at room temperature (23 'C) (the time of incubation 

25 found to be quite critical) . Therefore, all dilutions of 

standards (negative and positive) and unknowns was performed 
in a separate plate. Once all dilutions were done, the 
dilutions (100 fil) were transferred to the ELISA plate 
coated with pol employing a multichannel pipettor. All 

30 dilutions were with PBS Tween 20 (0.1%). 

The state of the serum or plasma sample was found to be 
important. Samples repeatedly frozen and thawed usually gave 
higher backgrounds. This was especially evident with samples 
from normcQ individuals. 

35 The plates were washed three times in PBS-Tween 20 after 
the 30 minute incubation with the first antibody. 
A Skatron II plate washer was employed for this purpose. 
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The second antibody used (goat anti-human Ig linked to 
horse radish peroxidase) was an affinity purified reagent 
obtained from Tago Diagnostics (Inter Medico To DNT cat 
2393). An appropriate dilution was determined experimen- 
5 tally (approximately 1:2,000) is made in PBS-Tween 20 

(0.1%). 100 lil was dispensed into the wells except for one 
which will be employed as a blank for the plate reader. The 
plate was incubated for 1 hour at room temperature. 

The plates were washed three times with PBS-Tween 20 
10 employing the Skatron II plate washer. 

Freshly prepared substrate (lOO fil) was added to the 
wells and after 20 minutes the reaction stopped with the 
addition of 100 fil of 0.07M H2SD4. 

The plate was read at 450 nm in the BIOTEK BL/310 ELISA 
15 plate reader. A hard copy of the data was obtained from the 
reader and the data also stored directly onto computer 
diskette for further processing by the Anelisar program. 

Additionally, controls were also performed on each 
plate. In two or three wells no serum or plasma was added. 
20 In one well no primary or secondary antibodies were added 
but substrate was. This well was employed to blank the ELISA 
plate reader. The remaining wells were employed to determine 
the extent of binding of the secondary antibody (Goat 
anti-HIg-HRPO) to POL. Thus, these wells received no primary 
25 antibody but secondary antibody and substrate with the 

appropriate washes in between each inciabation. Usually the 
value of this latter control is below 0.1000 OD. 

The results are shown in Figure 3. 

The following materials were use for the anti-pol ELISA 
30 procedure 
Buffers 

Phosphate B uffered Saline (PRc;) 

NajHPO^ (dibasic anhydrous) 13.6 g 

NaHjPO^ (monobasic) 2.4 g 

35 MaCl 90.0 g 

Salts are dissolved in 8 litres of distilled deionized 
water and pH is adjusted to 7.2 with NaOH or HCl. This 
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buffer is employed as coating buffer, diluent and washing 
buffer. The latter two buffers are modified as indicated 
below. 

Diluent for prima ry and secondary antibodies and washing 
5 buffer 

PBS +0.1% Tween 20 (Sigma, St. Louis MO) (0.1 ml Tween 
20 + 100 ml PBS) . The diluent buffer is made up daily. 
Substrate buffer 

Equal volumes of O.IM NajHPO^ (0.709 g/50 ml) and O.IM 
10 citric acid (0.960 g/50 ml). The pH is adjusted to 4.0 with 
NaOH or HCl, The substrate buffer is made up weekly. 
Substrate 

A tablet (2 mg) of o-phenylenediamine (Sigma cat. P6787) 

is dissolved into 10 ml of substrate buffer. Hydrogen 
15 peroxide (4 fil of 30%) is added to the solution just prior 

to plating. The solution should be kept in the dark as much 

as possible. 

Stopping reagent 

The enzymatic reaction is stopped with 0.07M HjSO^. 
20 It is a particularly advantageous feature of the 

polypeptides, the use of which is described herein, that 

they cross-react with antibodies against diverse strains of 

HIV. Thus, for example, the polypeptides described herein 
\ based on HIV-1 can cross-react with antibodies raised 
25 \ against various strains of HIV-l and HIV-2. Thus they may be 

used in diagnostic kits for detecting either virus category. 

Similarly, in vaccines they can provide broad-spectrum 

protection. 

Industrial Applicability 
30 As will be apparent from the above, the present 

invention can be used in the medical field for testing for 
HIV infection and for immunizing against HIV infection, as 
well as for other diagnostic or prognostic purposes. 



l1 
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TABLE 1 



HIV-1 pol protein sequence of HIVHXB2 virus 
Data from Human Retroviruses and AIDS 1988 
Los Alanos National LaJooratory 

AcNPV-HlVWHpQl 

HXVIIXB2 Met PheP heArgGluAspLeuAlQPheLouGlnGlvLvsAlQArqGluPh*>t;^r>^£>,-f;i .. .a 

HivBHioa ^ _ 

— -Gin 20 



HIVBH5 
HIVPV22 
HIVBRU 
HIVmN 
HIVSF2 
HIVRF 
HIVMAL 



-Gin 20 

-Gin 20 

-Gin 20 

. 0 




19 

-Leu 19 

Pro 19 



HIVELI Asn Pro Gly Leu ProLys i g 



HIVHXB2 : 

HIVBH1 02 ThrArgAlQAsnSerProThrlloSarSorGlu 

HIVBH5 ThrArgAloAsnSorProThrlleSerSerGlu 
HIVPV22 ThrArgAlaAsnSerProThrlleSerSerGlu 
HIVBRU ThrArgAlQAsnSerProThrlleSorSerGlu 



GlnThrArgAlaAsnSerProThrArg 20 



HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



i»0 

: — 40 

— 40 

' 0 

28 

^ 28 

-Ser 28 

Ser 28 



HIVBH102 ^'"^^^"^®"^^"^°^"^'"P^^^^'"5^^P^*"^^"^®'"^''°^^'"^^"^°^^VAlaAspArg ^Q 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 



HIVMAL 
HIVELI 



HIVSF2 GlyGlu Leu 

HIVRF 



SO 

60 

—-Leu 60 

0 

i*8 



-Arg — Gly . . , LysThrLeu Thr ~-Glu U7 

•Arg — , , .ProLeu LysThr Glu 47 



HIVHXB2 GlnGlyThrVolSerPhcAsnPheProGlnVQlThrLeuTrpGlnArgProLeuValThr 68 

HIV6H1 02 —————— ^ 

HIVBH5 



HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 



HIVELI 



-II© 50 



— He— 



80 



He Qg 

He Qo 



He 68 



HIVRF Sor —He zie B7 

HIVMAL He Ser He ^vol- G7 



Alo 57 



SUBSTITUTE SHEET 
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Table 1 contM 



<- gog cds end 

HIVHXB2 IleLysIleGlyGlyGlnLeuLysGluAloLeuUeuAspThrGlyAloAspAspThrVal 



HIVBH102 
HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



Arg- 

Val 



VolArgVal— 



88 
100 
100 
100 
100 
0 
88 
87 
87 
87 



HIVBH102 
HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



AcNPv-HIVYKpol starts 
HIVHXa2 LeuGIuGli^atS^rLeuProGlyArgTrpLysProLysMetlleGIyGlylleGlyGly 



• •••■•••a — — Asn*"— Apg— — — 



108 
120 
120 
120 
120 
17 
108 
107 
107 
107 



HIVHXB2 

HIVBH102 

HIV8H5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



PhelleLysValArgGlnTyrAspGlnllGLeuIleGIuIleCysGlyHisLysAlalle 



"Thi Gly- 

-ProVal— 



128 
1<»0 
140 
1'»0 
140 
37 
128 
127 
127 
127 



HIVHX82 GlyThrValLeuVoIGlyProThrProValAsnllelleGXyArgAsnLeuLeuThrGln 
HIVBH102 



HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



— Ile- 



-Met- 



148 
160 
160 
160 
160 
57 
148 
147 
147 
147 



CI lUQTITI ITC CWFPT 
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Table 1 cont'd 



\/ p6S. p51 



HIVBH102 
HIVBH5 
HIVPV22 



HIVMN Leu- 

HIVSF2 

HIVRF 



PpoGlyMetAspGlyProUysValLysGlnTrpProLeuThrGl 



HIVSF2 



KIVBH1D2 
HIVBH5 
HIVPV22 



-Asp Arg 



HIVHXB2 
HIVBH1D2 



HIVPV22 
HIVBRU 
HIVMN 



HIVELI . 



180 
160 



HIVSRU " 

T eo 



77 
168 



HIVMAL 2IZ ~Z IZ~~' '^^'^ 

HIVELI ^^"^ 



167 



uGliiLy^IleLysAlo 1 88 



HIVHXB2 

HIVBH102 

HIVBH5 ~I " 

HIVPV22 II~Z"~ ~ 

HIVBRU ~ ZIIZI II 

HIVMN " 200 



97 



HIVRF 

HIVMAL 
HIVELI 



HIVMAL . Arg I~~~ "^^^ 



187 
187 



h?^r!;?L '•^"^^^^^"^^^^^^^^'"^^"'^^"^"^VsGluGlyLysIlcSerLysIleGlyProGlu 208 



220 
220 



HIVBRU ~" _ 220 

HIVMN He I—Iimi II"I 

HIVSF2 — ^ II ~ "J^ mini 

HIVRF ' " 2t>8 

HIVMAL —Thr LysAsp I-LeuIIIIIIIIIIIIZ' HI 

HIVELI Thr Asd 





AsnProTyrAsnThrProVolPheAlQlleLysLysLysAspSerThrLysTrpArgLys 228 

HIVBH5 HI 2i»0 

2i>0 



2^0 
2<»0 



HIVSF2 II_I I" ~ ''^^ 

HIVRF : I ~~ ' * 228 

HIVMAL " 227 

227 

227 
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Table 1 cont'd 



HIVHXB2 LeuVolAspPheArgGluLeuAsnLysArgThrGlnAspPheTrpGluVolGlnLeuGly 2kB 

HIVBH102 260 

HIVBH5 Arg ^ 260 

HIVPV22 ' 260 

HIVBRU 260 

HIVMN Lys 157 

HIVSF2 2aa 

HIYRF 2U7 

HIVHXB2 IleProHisProAloGlyLeuLysLysLysLysSGrValThrValLouAspValGlyAsp 268 

HIVBH102 ^ 280 

HIVBH5 280 

HIVMN 177 

HIVSF2 268 

HIVRF 257 

HIVMAL 267 

HIVELI 267 

HIVHXB2 AlaTyrPheSerValProLeuAspGluAspPheArgLysTyrThrAlaPheThrllePro 280 

HIVBH102 300 

HIVBH5 ^- 300 

HIVPV22 300 

HIVBRU 300 

HIVSF2 L^s 288 

HIVRF — LysGlu 287 

HIVMAL 287 

HIVELI Ser 287 

HIVHXB2 SerlleAsnAsnGluThrProGlylleArgTyrGlnTyrAsnVQlLeuProGlnGlyTrp 308 

HIVBH102 320 

HIVBH5 SerGly 320 

HIVPV22 320 

HIVBRU 520 

HIVMN 217 

HIVSF2 T 308 

HIVRF Arg 307 

HIVMAL 307 

HIVELI 307 



e^i IDCTITI IXC CUICCT 
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Table 1 cont'd 



HIVHXB2 LysGlySerProAlQllePhoGlnSerSGrMotThrLysIleLeuGluProPheArgLys 328 
HIVBH102 Lys 3i»0 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



3^*0 

3i»0 

237 



Lys 327 

Thr 327 

327 



HIVHXB2 GlnAsnProAspIloValllGTyrGlnTyrMGtAspAspLeuTyrVolGlySerAspLeu ZUQ 

HI VB HI 02 — -•— ~— — — — ^ ^ ^ ^ 

HIYBH5 * ~~ 

HIVPV22 

HIVBRU ~ "~" 

HIVPON "~ ~ ' 3 " 

HIVSF2 ZIIIII I 

HIVRF Glu 1 ZJl 



HIVMAL Lys Glu— 

HIVELI GluMet 



HIVELI 



HIVBHS Phe 

HIVPV22 



HIVBRU 

HIVMN Phe— 

HIVSF2 Phe— 

HIVRF Phe— 

HIVMAL Phe- 

HIVELI Phe Arg 



360 
360 
360 
360 
257 
3i*8 
3if7 
347 
3i*7 



HIVHXS2 GluIlGGlyGlnHisArgThrLysIleGluGluLeuArgGlnHisLeuLeuArgTrpGly 368 

KIVBH5 

HIVPV22 

HIVBRU II 

HIVMN Ala 

HIVSF2 



380 

380 

Arg 277 

HIVRF Tie 7^ ! 

HIVMAL — t • 

— — Glu Uys 367 

Lys Glu 367 



HIYBH102 ^®"^^'""^*'''''2^^^P*-y^*-y^"^^^^"*-Y^^^"P'"^P'"OP^eLeuTrpMetGlyTyrGlu 388 



A00 
400 
400 
400 
297 
388 
387 
387 
387 
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Table 1 cont'd 



HIVHXB2 LeuHisProAspLysTrpThrValGlnProIleValLeuProGluLysAspSerTrpThr kOB 

HIVBHie2 4^20 

HIVBH5 lie . 420 

HIVPV22 420 

HIYBRU 420" 

HIVMN 317 

HIVRF 407 

HIVHXB2 ValAsnAspIleGlnLysLeuValGlyLysLeuAsnTrpAloSerGlnlleTyrProGly i^2B 

HIVBH5 440 

HIVBRU 440 

HIVMN Ala 337 

HIVELI Asn GluArg 427 

HIVHXB2 IleLysValArgGlnLeuCysLysLeuLeuArgGlyThrLysAloLeuThrGluVallle UkS 

HIVBH102 460 

HIVBH5 460 

HIVPV22 460 

HIVBRU 460 

HIVSF2 Lys 448 

HIVRF Lys Vol 447 

HIVMAL Lys Ala AspIloVal 447 

HIVHX62 ProLeuThrGluGluAloGluLeuGluLeuAloGluAsnArgGluIleLeuLysGluPro 468 

HIVBH102 480 

HIVBH5 480 

HIVPV22 480 

HIVBRU 480 

HIVMN , 377 

HIVSF2 468 

HIVRF Gin Lys ^ 467 

HIVMAL AlO — 467 
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Table 1 cont'd 



HIVHXB2 ValHisGlyVQlTyrTyrAspProSerLysAspLeuIleAlQGluIleGlnLvsGlnGlv 
HIVBH102 ^ ' 



HIVBHS 
HIVPV22 
HIVBRU 
HIVMN 



HIVSF2 Glu- 

HIVRF 

HIVMAL 

HIVELI 



AGS 
500 
500 
500 
500 
397 
488 
i»87 
A07 
ii07 



HIVHXB2 GlnGlyGlnTrpThrTyrGlnlleTyrGlnGluProPheLysAsnLouLysThrGlyLys 

HIVBH5 ~Z nil I ""~ ~ 

HIVPV22 zz~ m ~ * 

HIVBRU ~~i nil n 

HIVMN mill II " 2 " 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI His 



-GlnTyr- 



508 
520 
520 
520 
520 
U17 
508 
507 
507 
507 



HIVHXB2 TyrAlaArsMetArgGlyAlQHisThrAsnAspVQlLysGlnLouThrGluAlaValGln 

HIVBH102 — 

HIVBH5 ^ II II_II I~ ~ J] 

HIVPV22 1 II Iim. " 

HIVBRU imi ii_ "I n ~n 

HIVMN 1 ~~ ~ m 

HIVSF2 . IIII~" 

HIVRF 
HIVMAL 
HIVELI 



— IleLysSer 



Ala 



528 
5<»0 
5<f0 
540 
540 
437 
528 
527 
527 
527 



HIVHXB2 LysIleThrThrGluSerlleVolIleTrpGlyLysThrProLysPheLysLeuProIle 

HIVBH1 02 

HIVPV22 iiiiiimn n I ii 

HIVBRU ~ III"" '_\ " I "~I " ~ 

HIVMN Ala II"" " * _I 

HIVSF2 ValSer Xle ™ 

HIVRF YQlAlo _ " I" 

HIVMAL AlaGln ~Ar-III"Z~I_I 

HIVELI Arg~.Ser ^rg nArg_~II_. 



548 
560 
560 
560 
560 
457 
548 
547 
547 
547 
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Table 1 cont'd 



HIVHXB2 GloLysCluThrTppGluThrTrpTrpThrGluTyrTrpGlnAloThrTrpIXeProGlu 5S8 



HIVBH5 

HIVPV22 

HIVBRU 



Alo 



580 
580 
580 



HIVMN Thr+++ I 

HIVSF2- Alo Mot ~" ''^^ 

HIVRF Ala 

HIVMAL 
HIVELI 



566 
567 
567 
567 



HlI!Bm02 l^lll'Z^^^^^ 588 



HIVBH5 



HIVBH5 

-Ser — 



HIVBRU 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 



-Asp SerlleAla — 

Asp ser- 



600 
600 
600 



HIVPV22 

HIVBRU ~" 

HIVMN Vol 

HIVSF2 

HIVRF IIIIIIIIII! 

HIVMAL ^ 



h97 
588 
587 



HIVELI IIIIIIIIllliiiiiii~i~~"~2Iz!!!lz — I III 

"^b!S?02 ^IZlt""^^^^^^ B08 



620 



HIVPV22 620 



HIVMN 620 

Lys 517 



HIVSF2 

HIVRF He I ~ 608 

HIVMAL _ 607 

HIVELI He L 607 

607 

H?Sm02 tlltll'^^'^'^lZ^^^^^^ 628 

Lys 



640 

His 640 

640 
640 

Ser 

628 



HIVMAL Aso t 

.HIVELI Tzl ~ 1^"^ 

Asp 
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Table 1 cont'd 



H?^Bm02 tllltZt^^^^^^ SUB 

HIYBH5 His~™IIIII IZ~II 

HIVPV22 ~ 660 

HIVBRU . "-His III "ZZII 

HIVMN His~~ I " ' 660 

HIVSF2 ~" " 557 

HIYRF ^j^g 6U8 

HIVMAL "wis"" ^^-^ 

HIVELI Asn~~II~IIIIIIII~~^^'' ^""^ 



Hli;Bm02 tllllll^^^^^ G68 
HIVBH5 IIIIIZII Uys 600 



HIVPV22 



-Lys 680 



HIVBRU _ 680 

HIVMN ~ "-ys 680 

Hrvsr2 II ~ I — 

HIVRF ; ~*, 

HIVMAL Lys 667 

HIVELI "~ 

Lys 667 

wIJb^iL °^"^""°^"'-^"^°^*="gl"I^«XleGluGlnLeuIleLYSLysGlaLYsVolTvrLeu 600 

HIVBH5 1~ I ~ ~~ "700 

HIVPV22 1~ 700 

HIVBRU Z~„II III "700 

HIVMN sei milllllllll I~II 

HIVSF2 ZZIII * * -IIIH 

HIVRF : ser III I" 

HIVMAL iio ~ 

HIVELI r ~ ^^"^ 

607 



UJ^BmL tlll7Z°^^^^^ 708 

HIVBH5 IIII II~I_ " 720 

HIVPV22 IIIIIZII. 

HIVBRU IIIZZ mil II 

HIVMN zz ZZZZZZZ. 

HIVSF2 1 617 

HIVRF " 221 '^^^ 

HIVMAL Ser HZ I' "ZZ ^"^^ 

HIVELI _Z L. 
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Table 1 cont'd 



HIVHXB2 AlQGlylleArgLysValLeuPhoLeuAspGlyiaoAspUysAlaGXnAspGluHisGlu 728 

HIVBH5 lie Glu IIIII yJJ 

HIVPV22 II© 

uT\/aoti 

7£fO 

GluAsp 637 



HIVrviN 

HIVSF2 Asn ^-Glu 

HIVRF Thr 

HIVMAL Ser 

HIVELI Gin 



728 

727 

-727 

Glu 727 



HIVHXB2 LysTyrHisSerAsnTrpArgAlaMetAlaSerAspPhGAsnLouProProValVolAla 748 
HIVBH102 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 



760 

760 

760 

760 

-lie 657 

748 



747 

lie 747 

747 



HIVHXB2 LysGluIleValAlaSerCysAspLysCysGlnLeuLysGlyGluAloMetHisGlvGln 768 
HIV8H1C2 



HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



780 
780 
780 
780 
677 
768 
767 
767 
767 



HIVHXB2 ValAspCysSerProGlylleTrpGlnLeuAspCysThrHisLeuGluGlyLysVallle 788 
HIVBH102 



800 

QJJg 

800 

800 

697 

He 787 

He 787 

787 



ciirc;titi itp <;heet 
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Table 1 cont'd 



HIV8H5 IIZIIII ZZ. 

HIVPV22 ~I 

HIVBRU IIIZL. 

HivMN ZZZ IZZI Z'Z z 

HIVSF2 ' ZZZ ~ '. 

HIVRF ~~ ~ 

HIVMAL He Z I. Z 

HIVELI ZZZ 



808 
Q2C 
320 
820 
820 
7T7 
808 
807 
aC7 
807 



nlltmlz °^y°""'^^"^^^*^°-ryrPheLeuLeuLysLeuAloGlyAr9TrpProValLv 



HIVBH102 
HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



sThrlle 



He ~ 



Vol. — 
-Valval 
-Valval 



B20 

8i»0 
8£»0 
6^0 
?37 
828 
827 
827 
827 



HIVHXB2 HisThrAspAs 

HIVBH102 

HIV8H5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



nGlySerAsnPheThrGlyAlaThrValArgAlQAlQCysTrpTrpAlQ 
Ser Lys 

Sor '"^^ 



-Pro- 



Lys 

— SerThr Lys~ 

— SorThr Lys 

—SerThr Lys 

—SerThr Lys 



Thr 



-Sei Ala Lys 

"Ser Ala Lys 



8^8 
860 
860 
860 
060 
757 

0«i7 
847 
847 



HIVBH102 
HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



He 



Asn- 



868 
880 
880 
080 
880 
777 
868 
067 
067 
867 
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Table 1 cont'd 



HIVHXB2 MetAsnLysGluLeuLysLysIlelleGlyGlnValArgAspGlnAlaGluHisLeuLys 088 

HIVBH102 900 

HIVBH5 ■• 900 

HIVPV22 900 

HIVMN 797 

HIVSF2 Asn 888 

HIYRF Gin Gin 887 

HIVELI 887 

HIVHXB2 ThrAlaValGlnMetAlaVQlPhoIleHisAsnPhoLysArgLysGlyGlyllGGlyGly 900 

HIVaH102 920 

HIVBH5 920 

HIVPV22 920 

HIVMN Arg 817 

HIYSF2 908 

HIVRF : 907 

HIVMAL 907 

HIVELI ArgArg 907 

HIVHXB2 TyrSerAloGlyGluArglleValAspIlelleAlaThrAspIleGlnThrLysGluLeu 928 

HIV8H102 9f»0 

HIVBH5 g£j0 

HIVPV22 Qi^o 

HIVBRU 9<fr0 

HIVPON ^^Gly 837 

HIVSF2 928 

HIVRF 927 

HIVMAL lie Met 927 

HIVELI lie 927 

HIVHXB2 GlnLysGlnlleThrLysIleGlnAsnPheArgValTyrTyrArgAspSerArgAsnSer 9A8 

HIVBH102 Pro 960 

HIVPV22 Pro 960 

HIVRF AspPro 947 

HIVMAL Asn AspPro 947 

HIVELI lie AspPro 947 



CI IDCTITI ITC CUCCT 
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Table 1 cont'd 



HIVHXB2 LeuTrpUysGlyProAloLysLeuLeuTrpLysGlvGluGlyiUoValVolIleGlnAsp 968 

HIVBHl 02 — ' 

HIVBHS 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 

HIVRF His 



HIVELI Ilo- 



HIVHXB2 
HIVBH102 
HIVBHS 
HIVPV22 



HIVSF2 

HIVRF 

HIVMAL 



HIVELI Lys Vol. 



HIVHXB2 ■ GlnMetAlaGlvAspAspCysValAloSGrArgGlnAspGIuAsp+++ 1004 

HIVBHl 02 — ^ T»^^»-rT^—^ 

HIVBHS " I I"" 

HIVPV22 """"" II 

HIVBRU IIII milll 

HIVMN Thi I_I II 

H I VS F2 IIIIIIIIIIIIII 

HIVMAL —IIII^yGr^IIZZZIIIZIIIIII 

HIVELI 



1016 
1016 
1016 
1016 
913 
1004 
1003 
1003 
1003 



980 
980 
980 
960 
877 
968 



HIVMAL lie 



967 
967 



sor cds start -> 

AsnSerAspIleLysVolValProArgArgLysAlaLysIlelleArgAsfjTyrGlyLys 988 



1000 
1000 



HIVBRU I _ ~ ^^^^ 

HIVMN ...Asn i~~~;al~~:~::„i:~i:i ' 397 

988 

987 

987 

gg^ 
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TABLE 2 

HIV-l pol gene HIVHXB2 Sequence 

Data from Human Retroviruses and AIDS, 1988 

Los Alamos National Laboratory 

AcNPV-HlVWPlPol Vi rus 

' ' RBS 

Bam HI tl 
5 ' -GGATCCTATAAATATG tttttta gggaagatci: 
pol cds start {NH2-torminus uncertain) -> 



2101 


ggccttccta caagggaogg ccagggaatt ttcttcogag 


cagoccagag 


ccoacagccc 


2161 


caccogaaga 


gagcttcagg 


tctggggtog 


agocaacoQc 


tccccctcog 


Qogcaggagc 


2221 


cgatogacaa 


ggaactgtct 


cctttooctt 


ccctcaggtc 


actctttggc 


oacgacccct 


2281 


cgtcacQoTA 

< 


Aagatagggg ggcaactoaa 
:- gag cds end 


ggaagctcta 


ttagatacag 


gagcagatga 


2341 


tacagtotta* 


goagaaotga 


gtttgccagg 


aagatggaao 


ccaaaaatga 


togsgggoat 


2^01 


■tggaggtttt 


atcaaagtoa 


gacagtatga 


tcogatactc 


atagaaatct 


gtggacataa 


2461 


agctataggt 


acogtottag 


taggacctac 


uCC wg uCOQC 


otaattggaa 


gaaatctgtt 


2521 


gactcagatt 


ggttgcactt 


taaattttcc 


cu^xagccc w 


attgagactg 


taccogtoaa 


2581 


Qttaaagcca 


ggaotggatg 


gcccaaaagt 


taaacaotgg 


ccattgacag 


aogaaaaaat 


2641 


aooagcatta 


gtagaaattt 


gtocogagat 


ggaaoaggao 


gggaaaottt 


caaoaottgg 


2701 


gcctgaaoQt 


ccatacoota 


ctccagtott 


tgccotaaag 


aaaoaagaca 


gtactaaatg 


2761 


gogoaaatto 


gtagatttca 


gagaacttaa 


toagagaact 


coagacttct 


gggaogttca 


2821 


Qttoggaata ccacatcccg 


cogggttoaa 


aaagaoaaaa 


tcogtoacag 


tactggatgt 


2881 


gggtgatgca 


tatttttcag 


ttcccttoga 


tgacgocttc 


oggaogtata 


ctgcatttac 


2941 


catacctagt 


otaaacaatg 


ogacQCcogg 


gottogatat 


cogtocaatg 


tgcttccaca 


3001 


gggatggcaa 


ggatcaccag 


caatattcca 


oogtogcatg 


acoaaaatct 


togagccttt 


3061 


togaaoQcaa 


aatccagaca 


togttatcta 


tcoatocotg 


gatgatttgt 


Qtgtaggotc 


3121 


tgocttogaa 


atagggcagc 


otagaocaaa 


ootagoggag 


ctgagocooc 


Qtctgttgog 


3181 


gtggggoctt 


accacQccag 


acaaaaaaca 


tcogoaagaa 


cctccottcc 


tttggotggg 


3241 


ttatgaoctc 


catcctgata 


cotggocogt 


Qcogcctata 


gtgctgccag 


Qoaoogacag 


3301 


ctggoctgtc aatgacatac 


ogaogttagt 


ggggaoottg 


oattgggcao 


gtcogottta 


3361 


cccagggott 


ooagtooggc 


aattotgtoo 


QCtccttoga 


ggaaccooog 


CQctoacoga 


3421 


ogtoatacca 


ctaocogoog 


Qogcogagct 


cgaactggca 


goooacogag 


ogattctaaa 
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Table 2 cont'd 



3*81 agoQccogta catggogtgt attatgaccc atcoooogoc ttoatagcog QoatQcogaa 

35iH gcoggggcQQ ggceaatgga catatcaaat ttatcoagag ccatttaaaa otctgaaaac 

3601 oggaaaotot gcaogaatga ggggtgccca coctootgat gtoaoacoat taacogoggc 

3661 ogtgcaaaQQ otaaccecog aoogcatagt aotatgggga aogactccto DGtHtoaact 

3721 gcccQtacao aaggoaacat gggaaacotg gtggacogag tattggcoag ccacctggot 

3781 tcctgogtgg gogtttgtta otacccctcc cttogtgaaa ttotggtacc ogttagagoa 

38*1 ogaacecato gtoggagcag oaaccttcta tgtagotggg gcagctaaca gggogoctGo 

3901 ottoggaaaa geoggatotg ttactoatog oggoagocoa oaogtHgtca ccctoactgo 

3961 cQcaacQoat cogaogactg agttocoagc oatttatcta gctttgcogg attcgggott 

4021 ogaagtoaac atogtoocag octcocaata tgcottogga atcottcaog cacaoccaga 

4081 tcaaagtgoa tcagogttag tcootcoaat oatagogcog ttoatoaaaa oggaaaoggt 

4141 ctGtctggcQ tgggtoccog eococaoagg oottggogga oatgaocoag tagotooott 

4201 ogtcogtgct ggootcagga aagtoctott tttogotgaa otogotaogg cccoogatgc 

42S1 Qcatgogaao totcocogta attggagogc aotggctogt gottttaocc tgccacctgt 

4321 ogtogcoooa goaatogtog ccogctgtga taaotigtcag ctoaooggog oogccotgco 

4381 tggacQogta goctgtogtc caggcototg gcaactogot tgtococott togaaggooo 

44/ii ogttatcctg gtogcagttc atgtegccog tggotatota goagcagaag ttattccogc 

4501 ogooacaggg coggooocog cotattttct tttaoaotto gcoggaogot ggccagtoao 

4561 oacoatacQt actgocaotg gcogcoattt caccggtgct acggttoggg ccgcctgttg 

4621 gtgggcgggo Gtcoogcagg aGtttggoot tccctocaot ccccoGogtc ooggogtogt 

4681 ogoatctotg Qatoaagaat toaogoaaot totGggocGg gtoogogBtc GggctgoGCG 

4741 tcttoagaco gcGgtacaaa tggcGgtatt cotccocaot tttooQogoo oaggggggat 

4801 tggggggtac ogtgcogggg ooagoatogt Ggacotoata gcaGcogoca tacooactoa 

4861 GgGGttocGQ aaocQQQtta cooooattcc ooattttcgg gtttottacA GggocagcGg 

/\ 3'sj. 

4921 ooQttccctt tggoGoggoc cogcGoogct cctctggGOG gGTgGogggg cogtogtGot 

S'sj /\ 

4981 oeoogatoct ogtgGCOtoo Gogtagtgcc oagaogoooG gcooogotco ttogggottA 

SOP 23 kD cds start -> 

5041 TCgooooccg otggcoggtg ctgo^tgtgt ggcoGgtGgo coggotOGGG o^r.^ ^... 

Crossover <- poi end 

linker Sequence 
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TABLE 5 

HIV-l pol gene 
HIVHXB2 Sequence 

Data from Human Retroviruses and AIDS, 1988 
Los Alamos National Laboratory 

AcNPV'HIVWHpoI V irus 

— RBS 

Bam HI _ti_(AcNPV-HIVWUpol start) 

5 • -GGATCCTATAAATATG tttttto gggoogotst*" 
pol cds start ( NH2-torminus uncertain) -> 



2101 


ggccttccta 


coagggaagg 


ccogggoctt 


ttcttcagag 


cagoccagog 


ccaacagccc 


2161 


CQCcagaoga 


gagcttcagg 


tctggggtag 


agocaocoac 


tccccctcog 


GQgcaggogc 


2221 


cgatagocaa 


ggoactgtot 


cctttaoctt 


ccctcoggtc 


octctttggc 


aacgocccct 


22S1 


cgtcacQoTA 


Aagotogggg 


ggcaactaaa 


ggaagctcto 


ttagatacog 


gagcagatga 


AcNPV-HI\^Kdo 1 


Bam HI 


KES ti 








Virus 


5 • -GGATCCTATAAATATG 


' Cti;AcNPV 


-KIVYKpol 


start) 


2341 


tocogtattot 


gaagaaatga 


gtttgccagg 


aagotggaaa 


ccoaaaatgo 


tagggggaat 


2Ci01 


"tggaggtttt 


atcaaogtoa 


gacagtatga 


tcagotoctc 


otagaaatct 


gtggocotGQ 


2U61 


cgctQtaggt 


Qcagtattag 


taggacctac 


occtgtcoac 


ataottggaa 


gaaatctgtt 


2521 


gactcogatt 


ggttgcactt 


taoottttcc 


cattagccct 


attgagactg 


taccogtoaa 


25B1 


cttoaagcca 


ggoatggatg 


gcccaaaogt tooocaatgg 


ccattgacag 


GogQaaaaat 


26A1 


ooaagcatta 


gtagaaattt 


gtocagogat 


ggaaoaggoa 


gggaooattt 


coQoaQttgg 


2701 


gcctgooaat 


ccotocaota 


ctccogtott tgccotooag 


oaaaoagoca 


gtactaoQtg 


27S1 


gagoaaotta 


gtogatttca 


gogoacttaa 


taagogoact 


coagocttct 


gggaogttca 


2821 


Qttoggaata 


ccacatcccg 


cagggttaaa 


aaogaaaaaa 


tcagtaocog 


toctggatgt 


2S31 


ggg'tgotgca 


tatttttcog 


ttcccttoga 


tgaagacttc 


aggaogtato 


CtgCGtttGC 


29^*1 


catacctogt 


GtQoacaatg 


ogacaccagg 


gottagatat 


cagtocaatg 


tgCttCCGCQ 


3001 


gggotggaaa 


ggotcGccag 


caatottcca 


aogtogcotg 


Qcoaaootct 


tagogccttt 


3061 


tagoaaacaa 


ootccogocQ 


togttotcta 


tcaatQcatg 


gotgatttgt 


otgtoggatc 


3121 


tgacttogoQ 


atagggcagc 


atogaocoao 


OGtogaggog 


ctgagacooc 


Gtctgttgag 


3181 


gtggggactt 


occQcaeecg 


ocoooQoaca 


tcogaaogaa 


cctccottcc 


tttggotggg 


3241 


ttotgoactc 


cotcctgato 


oatggacagt 


acogectota 


gtgctgccag 


OGaaogGcog 


3301 


ctggoctgtc 


oatgocotQc 


ogaogttagt 


ggggooattg 


OGttgggcoa 


gtcogottto 


3361 


cccagggart 


OQogtooggc 


oottatgtoQ 


Qctccttaga 


ggaaccQGog 


CQCtoacaga 


5t*2l 


ogtootocca 


ctaocogoog 


oogccgogct 


agooctggca 


gaaaocogag 


ogottctOGo 
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Table 3 cont'd 

3^.81 ogocccogta cotggagtgt ottotgoccc ctcoaoagoc ttootogcog caotocagco 
3541 gccggggcaa ggcccatgga cototcaoot tftcocgag ccottfooa otctgooocc 
3S01 oggocaatot gcaogootga ggggtgccca coctaotgot gtooaocoat t:aacogaggc 
3661 ogtgcaocao ctaoccacog cocgcatogt oototgggga aogcctccto catttaocct: 
3721 gcccotocoo ocggooccot gggocacotg gtggccogog tcttggcaog ccocctggot 
3781 tcctgogtgg gogtttgtto otacccctcc cttagtgooo ttotggtocc ogttogogco 
3841 ogooccccto g^oggagcog ooaccttc^o tgtogotggg gcogctooco gggogoc^oa 
3901 cttoggooao gccggototg ttoctaotog oggaogocoo ooogttgtco ccctooctgo 
3961 cocoocaoot cogoogoc.g og.tccoogc oatttotcto gctttgccgg ct.cgggott 
4021 cgocgtoocc otcgtcocag octcocooto tgcot^ogga otcottcoog cocoaccaga 
4081 tcaoogtgoa tcogogttog tccotcoaat ootogogcog ttaotaoaoc oggooooggt 
4141 ctatctggca l:gggtoccog cococoaagg oottggoggc eatgaocoog togatoaatt 
4201 cgtcogt:gct ggootccgga oogtoctott t:t:tcgatgga otagotcogg cccaogc.g. 
4261 ccotgogooo totcocogta cttggogcgc oatggotcgt gottttoocc tgccocctgt 
4321 cgtogcoooa gocotogtog cccgctgtgo tocatgtcag ctooooggog acgccotgco 
A381 tggocoagta goctgtcgtc coggoototg gcccctogat tgtococott togocggooo 
A441 cg..atcctg gtcgccgttc otgtogcccg tggaftato googcogaog ttattccogc 
4501 ogoaacoggg coggaaocog catottttct tttooaatto gcoggoogat ggccogtoco 
4561 cocoal:acat cctgoccotg gccgcoottt coccggtgct ocggttoggg ccgcctgt^g 
4621 gtgggcgggo ctcoogccgg ootttggaat tccctoccat ccccaoogtc ooggogtogt 
4681 cgoctctotg catooogoct tcocgoocot to.oggocog gtaogogctc oggctgooco 
4741 tcttaogoco gcogtccooo tggcogtott cctcccccot tttocoogoc coggggggot 
4801 tggggggtac agtgcagggg oaogoctogt ogacotoata gcoocogaco toccooctoc 
4861 ogoottacao aaocaootto coooocttco ooottttcgg gtttottccA Gggccagcog 

/\ 3'sj 

4921 oo«tcoctt tggco«ggoc cogcoocgct cctctggooo gCTgoogggg cogtogtcot 

/\ 
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Table 3 cont'd 



acaogataat agtgacotoa oagtagtgcc aagaagoaoa gcoaagatca ttogggattA 

Bam HI 

5041 TGgaaaocag otggcaggtg atgottgtgt ggcoagtaga caggatgagg otTAGGATCC 

-Sph I 

GCATG-3 
<- pol ©nd 



5101 


ggaaaagttt 


agtaaoacoc 


ca'tatgtatig 


tttcagggoa 


ogctogggga 


tggttttota 


5161 


gacotcGcta 


tgaaagccct: 


cQ'tccaagaa 


'taagttcaga 


ogtococQtc 


ccQctogggg 


5221 


Qtgctagatt 


ggtaatOQca 


Qcotattggg 


gtctgcQtoc 


aggogaaaga 


gactggcatt: 


5281 


tgggtcoggg 


cgtctccQta 


goatggagga 


aoaogaga'ta 


tagcacQcaa 


gtagoccctig 


5341 


OQctagcaga ccaac'toa't't 


catctgtat-t 


octttgactg 


"tttttcAGac tctgcta-tOQ 
/\ 3'sj 


5401 


gaaaggcc't't attaggacac 


Qtagt-tagcc 


ctoggtgtga 


QtQtcaagcQ 


ggocataacQ 


5461 ogGTaggotc 
5'sj /\ 


tctacaatac 


ttggcactog 


cagcattoat 


QQCQCcaaaa 


aagataaogc 


5521 


cacctttgcc 


togtgttacg 


aaactgacog 


aggotagATG 


goacaagccc 


cogaagaccQ 



R orf cds start -> 

5581 agggccacag agggagccoc acaatgaatg gococTAGag cttttogagg agcttaagaa 

<- sor 23 kD cds end 

5641 tgaagctgtt ogacattttc ctaggatttg gctccatggc ttogggcaac atatctatga 

5701 QQCttatggg gotacttggg caggogtgga ogccatoato ogaattctgc aacaactgct 

5761 gtttatccat tttcAGaatt gggtgtcgac oTAGcogaat oggcgttact cgacogogga 

/\ 3'sj <- R orf cds end 

5821 gagcaagaaA TGgagccagt ogotcctaga ctagogccct ggaogcatcc oggaogtcog 
tot cds start -> 

5881 cctaaaactg cttgtaccoa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt 

5941 ttcataacaa oogccttogg cotctcctAT GgcAGgooga agcggogaco gcgacgooga 
trs/ort cds start -> /\ 3*sj 

6001 gctcatcaga acagtcogac tcatcaagct tctctatcaa ogcaGToagt ogtacotgta 

(tot, trs/ort. 27 kD) S'sj /\ 

6061 AcGcaaccta toccaotcgt ogcoatagto gcattogtog togcoatoot oatagcaata 
U err -> 

6121 gttgtgtggt ccatogtoot cotagoatat oggaaaatot toogocoaag ooaaotagoc 
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CLAIMS: 

1. The use of a polypeptide as a reagent in a 
diagnostic test for HIV infection or as a vaccine against 
HIV infection characterized in that said polypeptide 

5 comprises a substantial portion of each of more than one of 
the enzymes coded for by the HIV-pol gene. 

2. The use claimed in Claim 1 characterized in that 
said polypeptide comprises a plurality of enzymes selected 
from HIV-pol protease, HIV-pol reverse transcriptase 

10 HIV-pol RNAse H and HIV-pol Integrase. 

3. The use claimed in Claim 2 characterized in that 
said polypeptide comprises substantial portions of all four 
of said enzymes. 

4. The use claimed in Claim 2 characterized in that 

15 said polypeptide omits at least that part of the amino acid 
sequence of the HIV-pol protease gene which codes for the 
active site responsible for proteolytic activity. 

5. The use claimed in Claim 3 characterized in that said 
polypeptide omits at least that part of the amino acid 

20 sequence of the HIV-pol protease gene which codes for the 
active site responsible for proteolytic activity. 

6. A diagnostic kit for detecting antibodies to HIV 
antigens characterized in that said kit contains as a test 
reagent, a polypeptide as defined in Claim i, claim 2, 

25 Claim 3, Claim 4 or Claim 5. 

7. A vaccine for protecting an individual against HIV 
infection comprising a polypeptide and a pharmaceutically 
acceptable carrier, characterized in that said polypeptide 
xs as claimed in Claim 1, claim 2, Claim 3, Claim 4 or 

30 Claim 5. 

8. A polypeptide comprising a substantial portion of 
each of more than one of the enzymes coded for by the 
HIV-pol gene characterised by omitting at least that part of 
the amino acid sequence of the HIV-pol protease gene which 

35 codes for the active site responsible for proteolytic 
activity. 
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9. A polypeptide as claimed in Claim 8 characterized by 

comprising sequences of a plurality of enzymes selected from 
HIV-pol protease, HIV-pol reverse transcriptase, HIV-pol 
RNase H and HIV-pol Integrase. 
5 10. A polypeptide according to Claim 9 characterized 

in that said polypeptide contains siabstantial portions of 
all four of said enzymes, 

11. A polypeptide according to Claim 8 characterized in 

that said polypeptide has an amino acid sequence 
10 substantially as shown in Table 3 beginning with the amino 
acid Met marked "AcNPV-HIVYKpol starts". 
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